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Abstract: Olfactory receptors (ORs), the first dedicated molecules with 
which odorants physically interact to arouse an olfactory sensation, con¬ 
stitute the largest gene family in vertebrates. Dogs and wolves, like many 
other mammals, have a highly developed capability to detect and identify 
odorant molecules, even at minimum concentrations. In this study, the 
olfactory receptor repertoire from domestic dog and its closest relative, 
the wolf, were sequenced to estimate the fraction of pseudogenes in each 
subspecies. The fraction of disrupted olfactory receptor genes in dog was 
17.78%, whereas, that in wolf was 12.08%. As expected the dog was less 
dependent on olfaction than the wolf, and the dog had more olfactory 
receptor pseudogenes. However, the observed difference between the two 
subspecies was not at the significant level (x 2 = 1.388, p = 0.239 > 0.05). 
The values indicated that although domestication might play a role in the 
reduction of OR genes, it could not be concluded that the living envi¬ 
ronment provided by domestication lead to a significant reduction of the 
functional olfactory receptor repertoire. Furthermore, the purpose of 
domestication may also have influence on the ratio of functional olfac¬ 
tory receptor genes reduction. 
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Introduction 

Olfaction, the sense of smell, is an important way used by ani- 
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mals to communicate with environment and their companions. It 
presents in most animal species and the mechanisms are shared 
among distantly related taxonomical orders (Strausfeld and Hil¬ 
debrand 1999). Olfactory receptor (OR) genes were first discov¬ 
ered in Rattus norvegicus in 1991 by Buck and Axel (Buck and 
Axel 1991). Olfactory receptors proteins are expressed on the 
surface of the cilia of the olfactory sensory neurons lining the 
neuroepithelium in the nasal cavity and are capable of binding 
odorants. For discriminating among the millions of possible 
odorants, the mammalian olfactory system possesses a repertoire 
of approximately more than 1000 different olfactory receptors. 
All of the olfactory receptors belong to the G protein-coupled 
receptor superfamily and are evolutionarily related (Young et al. 
2002 ). 

Animal species can be subdivided depending on how critical 
the olfaction is for their survival (Scott and Fuller 1965; Gilad et 
al. 2004). It is supposed that in primates such as humans and 
apes, the senses rather than the olfaction may be more important 
than smell. This strategy is reflected in some morphological fea¬ 
tures of primates such as a relatively small nose and large frontal 
eyes (Gilad et al. 2003). And the reduced dependence on smell is 
also evident on the molecular level. Therefore, it has been shown 
that humans have an extremely reduced repertoire compared to 
the mouse or even to other primates. In humans, about 60% of 
the olfactory receptors genes carry additional disruptions in their 
coding region (Glusman et al. 2000), while this fraction is equal 
to 40% in the chimpanzee and 20% in the mouse (Rouquier et al. 
2000). One possible reason for the exceptionally high fraction of 
pseudogenes in humans is that the conservation of olfactory re¬ 
ceptors in humans was relaxed as the special human lifestyle. If 
the lifestyle is critical factor leading to this exceptionally high 
fraction of pseudogenes, other species which underwent a similar 
change in lifestyle should exhibit a similarly changed functional 
olfactory repertoire. 

In order to verify this hypothesis, in the present study the ol¬ 
factory receptor repertoire from domestic dog and its closest 
relative, the wolf, were sequenced to estimate the fraction of 
pseudogenes in each subspecies. 
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Materials and methods 

Genomic DNA extraction 

Blood sample from a female wolf was obtained from the Dalai 
Lake Nature Reserve in Inner Mongolia of China, and the other 
blood sample was obtained from a female domestic dog in 
Shandong province of China. Total genomic DNAs were ex¬ 
tracted from the blood following the method of Sambrook and 
Russell (2001). 

Degenerate primer construction 

Two sets of degenerate primers were designed on the conserved 
regions of Human OR genes 1 and 7 (HI and H7), and each set 
should amplified approximately 700 bp of the coding region. 
Primer pair PCI was designed on human Class I OR genes only, 
primer pair PC2 on human Class II with the exclusion of sub¬ 
family 7E. Both primers were 26 nucleotides long. Sequences are 
as follows: 

PC 1 -5:5 ’ -CT(GC)C A(CT)(GC) A( AG)CCC A-TGT( AT)(CT)( AC 
T)(AT)(CT)TT (GCT)CT-3 ’; 

PC 1 -3:5 ’ -GT(CT)(CT)T(GC) A(CT)( AGT)C( ACT)( AG)TA( AG) 
A(CT)(AG)A(CT)(AG)GGG TT-3’; 

PC2-5:5 ’ -(CT)T( AGCT)C A(CT)( AT)C( ACT)CC( ACT) ATGTA( 
CT)TT (CT)TT (GCT)CT-3 ’; 

PC2-3:5 ’ -TT (CT)CT (AGCT) A( AG)G(GC)T (AG)TAGAT (AGC 
T)A(AGCT) (AGT) GG(AG) TT-3’. 

The PCI pair had a total degeneracy of 1.06><10 7 , i.e.: many 
primer combinations were potentially present in the reaction 

n 

mixture. This value for PC2 is 2.12x10 . In human, primer pair 
PCI amplifies an approximately equal proportion of Class I and 
Class II OR genes. Primer pair PC2 amplifies exclusively Class 
II OR genes. To compensate for this effect the final sample was 
constructed to contain 25% of sequences amplified by primer 
pair PCI. 

Primers were designed on all the OR sequences contained in 
the HORDE database (Godfrey et al. 2004) and experimentally 
tested on the mouse and the domestic dog DNAs. For these spe¬ 
cies, the family composition and the exact fractions of pseu¬ 
dogenes were known (Zhang and Firestein 2002). 

Degenerate PCR 

PCR was performed in a total volume of 50 pL containing 0.25 
mmol of each deoxynucleotide (Promega, Madison WI (9)), 10 
pmol each of forward and of reverse primer, 25 mmol MgC12, 10 
mmol Tris pH 8.3, five units of Taq DNA polymerase and 50 ng 
of genomic DNA. PCR conditions were as follows: 5 min at 
95°C followed by 45 cycles of denaturation at 95°C for 40 s, 
annealing at 47°C for 30 s and elongation at 72°C for 2 min; 
followed by an additional elongation step at 72°C for 10 min. 
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Cloning 

PCR products were visualized on a 1.8% agarose gel and subse¬ 
quently cloned AE into chemocompetent E. coli using the 
pGEM-T Easy TA Cloning Kit (Invitrogen, Carlsbad, CA (5)) 
following the standard protocol (Version P) with 4 pL of PCR 
products and a ligation time of 30 min. 

Colony PCR 

Single bacterial white colonies where picked and suspended in 
40 pL of PCR reaction mix and amplified by touch down PCR 
using the vector primers. Conditions of the touch down PCR 
were as follows: an initial denaturation step at 94°C for 5 min, 40 
cycles of denaturation at 94°C for 30 s, annealing at 57°C for 1 
min and elongation at 72°C for 1 min, followed by a terminal 
elongation step at 72°C for 15 min. 

Sequencing 

PCR products (5 pL) of positive colony were visualized on a 
1.8% agarose gel and the remaining product was purified using 
the method of ethanol/EDTA/NaAC. Sequencing reactions were 
performed on PCR products in both directions with a 
dye-terminator cycle sequencing kit on ABI PRISM 3700 auto¬ 
mated DNA sequencer (Applied Biosystems, Foster City, CA 
(!))• 

Sequence analysis 

The base was called with the ANALYSIS software, Ver. 3.0 (Ap¬ 
plied Biosystems, Foster City, CA (1)). The sequences were vis¬ 
ually inspected and edited by SEQMAN, Ver. 5.00 (DNASTAR, 
Madison, WI (2)). Primer sequences were removed and individ¬ 
ual clones with more than 98% identity were joined into one 
consensus sequence. This cut-off ensured that Taq-generated 
mutations that might have been sequenced in individual clones 
were not counted as independent genes. Then we did blast in 
NCBI. Sequences with no best-hit to known OR genes were 
removed. 

DNAsp4 was used to identify stop codons within the se¬ 
quences obtained. If there was no interrupted frame, the se¬ 
quence was considered as a functional gene. If one or more addi¬ 
tional interrupted coding region could be found, the sequence 
was assigned as pseudogene. In some cases, the combining of 
several clones can create some ambiguous positions. We checked 
all nucleotide variants of the ambiguous sequences with the same 
set of rules. If a complete open reading frame existed only re¬ 
sulted from these ambiguous positions and the alternative would 
result in a stop codon, the sequence was removed from the 
analysis. 

The observed differences in proportions of pseudogenes be¬ 
tween wolf and dog were calculated by/ 2 test. 

Sample representation 

Degenerate primers were chosen to get an overview over the 
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domestic dog and the wolf OR repertoires. Nevertheless, using 
degenerate primers might have risk of biased amplification cer¬ 
tain genes rather than others. The result from these primers could 
not represent the familial composition of the OR repertoire. On 
the other hand, OR families contain different fractions of pseu¬ 
dogenes. Therefore, estimating the OR pseudogene fraction for 
the entire repertoire could be meaningless. A pilot project was 
carried out to test the performance of the degenerate primers 
prior to sampling OR genes from domestic dog and wolf. 

A set of degenerate primers were designed on the basis of the 
H2 (forward primer) and H7 (reverse primer) domains of all 
human OR genes according to the HORDE database, using the 
program DEFOG (Fuchs et al. 2002; Matasci 2003). The primer 
set PCI was designed on all Class I Ors, while the primer set 
PC2 was designed on all Class II OR genes. 60 OR genes were 
amplified from dog genomic DNA with both primer sets. The 
result showed that primer set PCI amplifies OR genes from 
Class I and Class II in equal proportions, and PC2 amplifies 
Class II OR genes only. It was induced that a sample composed 
of 25% of OR genes amplified by PCI and 75% of OR genes 
amplified by PC2 would reflect the familial composition of OR 
genes seen in dogs. It was confirmed that the estimation was 
correct through the analysis of a sample of 100 OR genes using 
the 25% PCl+75% PC2 rule for domestic dog. 

In an analysis of the olfactory repertoire of the dog, both a de¬ 
generate PCR based approach and a data-mining of the dog draft 
genome approach have been undertaken. Both approaches re¬ 
sulted in a similar estimate of the fraction of pseudogenes present 
in the OR repertoire (Olender et al. 2004). Therefore, according 
to the analysis, the sample would reflect the familial composition 
of the OR repertoire of the corresponding species. 

Sampling of OR genes 

The required number of at least 100 OR genes were selected 
from 700 sequenced bacterial colonies for each subspecies. The 
genes of insufficient sequence quality or read length were re¬ 
moved from our test. And if the similarity of two genes exceeded 
98%, they were considered as one gene. At last, 127 OR genes 
were got for domestic dog and 131 for wolf. 108 sequences of 
correct length were collected for the wolf, 26 from the amplifica¬ 
tion with the PCI primers and 82 from the one with the PC2. For 
dog the number of sequences for the test amplified with PCI was 
26, whereas, the number for PC2 was 81. Through this selection, 
it was confirmed that the comparison was significative. 

Results and discussions 

Estimate the fraction of OR pseudogenes 

As known, the OR genes have no introns in their coding frame. 
Pseudogenes were defined as carrying an in-frame premature 
stop codon within the coding region. Premature stop codons 
could be the result of nonsense point mutations or frameshift 
mutations caused by insertions or deletions. These ORs with a 
premature stop codon were functional, since the 700 bp region 


amplified in the study was located between the transmemebrane 
domains H2 and H7, which were considered to be of critical 
importance for the functioning of these proteins (Buck and Axel 
1991). Because of the OR genes encoded by a coding region with 
no introns, such stop codons could not be overlooked by certain 
mechanisms such as alternative splicing. On the other way, as¬ 
sessing the fraction of pseudogenes by scanning only 700 bp 
resulted in an underestimate of the real fraction of gene disrup¬ 
tions, which were likely to occur over the entire coding region. 
OR genes were aligned using the ClustalW algorithm and den¬ 
drograms were visually inspected (Fig. 1(a)). No clusters com¬ 
posed exclusively of pseudogenes could be detected. 




Fig. 1 Unrooted neighbour-joining trees based on nucleotide se¬ 
quence distances for watchdog (a) and wolf (b). Branch lines: slim 
lines, functional genes; thick lines, pseudogenes. 

The fraction of OR genes could be determined for a set of 107 
dog OR genes and 108 wolf OR genes. For dog the fraction of 
pseudogene is 17.78%, while that for wolf is 12.08%. The ob¬ 
served difference in proportions of pseudogenes is not significant 
(x =1.388, jP=0.239>0.05). 

Potential factors for the fraction of observed pseudogenes 

Some other factors rather than biological ones might also influ- 
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ence the fraction of pseudogenes observed in the two subspecies. 
As shown in the pilot study, the bias of the degenerate primers 
towards certain families might significantly influence the frac¬ 
tion of observed pseudogenes. In the study, the results of dogs 
were compared with the ones obtained by genome-mining analy¬ 
sis of this species (Matarazzo et al. 1998). The comparison 
showed that the comparison in the study between dogs and wolf 
is meaningful. And even if there was the deviation, the results 
could not be significantly affected due to the similar familial 
composition in the two subspecies. On the other hand, this prob¬ 
lem would affect both subspecies alike, resulting in two esti¬ 
mates which are equally biased. Furthermore, the cladograms 
depicted in Fig. 1(a) and (b), where no major branch exclusively 
composed of pseudogenes could be detected, provided additional 
support for the fact that the primers used in this study are unbi¬ 
ased. 

Mistakes in the DNA sequence resulted in another possible 
source of error. While incorrect base calling rarely caused stop 
codons (<5% of random codon mutations would result in a stop 
codon), erroneous insertion or deletion within the coding region 
resulted in an inferred protein sequence, which usually could not 
bear any similitude with the original one and was extremely 
likely to be truncated. For such base calling problems depending 
on the quality of the reads of the DNA sequences amplified and 
minimizing the difference between the two subspecies, all labo¬ 
ratory operations were performed in parallel. Furthermore, we 
used only good quality sequences with reads from both strands. 

In a study on OR genes from pig (Wagner and Lykke 2002), 
15 OR genes sequences were detected via amplification of cDNA 
obtained from mRNA extracted from olfactory epithelium. But 
none of these OR genes were pseudogenes. We think that OR 
genes that carry destructive mutation are already pseudogenes at 
first, and they might have already been switched off by some 
other means, possibly acting on promoters and enhancers. Muta¬ 
tions can accumulate without any effect. The proteins that origi¬ 
nally encoded by these genes are not produced anymore, even 
dramatic changes like frame shift or non-sense mutations can 
accumulate in the genome. Additionally, even if mRNA encoding 
for such non-functional genes is produced, it might be recog¬ 
nized as bad and immediately destroyed by the non-sense medi¬ 
ated RNA decay mechanism (Mombaerts 1999) Such factors 
may explain why mRNA of OR pseudogenes could not be found 
in these studies. The same phenomenon can be observed in the 
mouse. None pseudogene was found in the 55 OR genes detected 
in mouse from olfactory epithelium, while the actual value for 
the fraction of pseudogenes in mouse was 20% (Zhang and 
Firestein 2002). 

Fraction of pseudogenes 

According to the result of previous studies, 60% or more of the 
human OR genes carry one or more disruptions (Glusman et al. 
2001), on the other way, the overall number of OR genes is sim¬ 
ilar in all mammals. It is a hypothesis that the loss of functional 
OR genes is owing to a relaxation of evolutionary constraints on 
olfaction resulting from the human lifestyle. For investigating 

^ Springer 


this hypothesis further, it is to inferred whether domesticated 
species have a higher proportion of pseudogenes in their OR 
repertoire than their closest wild living relatives. The proportion 
of OR pseudogenes was estimated to find the key to this question 
in the repertoires of a domestic dog and a wolf. It was found that 
the fraction of pseudogenes defined as interruption of the in¬ 
ferred protein sequence is 17.78% in the dog, while that in the 
wolf is 12.08%, which is 5.70% lower than its closest domestic 
relative. However, this difference is not statistically significant. 
If these estimates represent the real proportions of pseudogenes 
in each subspecies, it would require a much larger sample 
(roughly 300 genes from each subspecies) in order to have 
enough power to achieve statistical significance at the 5% level. 
In this study, we does not consider genes with interrupted coding 
region are not expressed, correctly folded or transported to the 
cell surface to perform their function. These are also 
non-functional genes, which are much more difficult to identify. 
The fractions presented here, as well as in other studies, are thus 
lower estimates. 

OR familial composition in dog and wolf 

From the results of the studies (Table 1 and 2), the difference in 
the relative sizes of OR families between the two subspecies is 
relatively limited. Therefore, it is concluded that the comparison 
is meaningful. 


Table 1. OR fa mili al composition and pseudogene fraction in domes¬ 
tic dog 


Family 

OR No. 

Family 

(%) 

Pseudogenes 

No. 

Pseudogenes 

fraction 

1 

2 

1.87% 

0 

0.00% 

2 

9 

8.41% 

0 

0.00% 

3 

4 

3.74% 

0 

0.00% 

4 

8 

7.48% 

0 

0.00% 

5 

8 

7.48% 

1 

12.50% 

6 

4 

3.74% 

1 

25.00% 

7 

6 

5.61% 

1 

16.67% 

8 

14 

13.08% 

2 

14.29% 

9 

3 

2.80% 

1 

33.33% 

10 

13 

12.15% 

3 

20.08% 

11 

5 

4.67% 

4 

80.00% 

13 

7 

6.54% 

3 

42.86% 

51 

5 

4.67% 

1 

20.00% 

52 

10 

9.35% 

2 

20.00% 

56 

1 

0.93% 

0 

0.00% 

others 

8 

7.48% 

0 

0.00% 

total 

107 

100% 

19 

17.78% 


Comparison with other mammals 

The OR familial composition is similar across mammals, rodents 
and carnivores, so the figures can be compared with the values 
obtained in similar studies. Especially, since we used the same 
set of primers as in Gilad et al. (2003), we will compare the re¬ 
sults of this study with these studies. From the comparison, the 
fraction of pseudogenes in dog and wolf are lower than that in 
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other mammals (Fig. 2). These data might supply some molecu¬ 
lar basis of canine prominent chemosensory capacities. 


Table 2. OR fa mili al composition and pseudogene fraction in wolf 


Family 

OR 

No. 

Family 

(%) 

Pseudogenes 

No. 

Pseudogenes 

fraction 

1 

2 

1.85% 

0 

0.00% 

2 

11 

10.19% 

1 

9.09% 

3 

1 

0.93% 

0 

0.00% 

4 

4 

3.70% 

0 

0.00% 

5 

6 

5.56% 

1 

16.67% 

6 

0 

0.00% 

0 

0.00% 

7 

16 

14.81% 

0 

0.00% 

8 

13 

12.04% 

0 

0.00% 

9 

5 

4.63% 

0 

0.00% 

10 

10 

9.26% 

1 

10.00% 

11 

4 

3.70% 

2 

50.00% 

13 

7 

6.49% 

3 

42.86% 

51 

6 

5.56% 

0 

0.00% 

52 

6 

5.56% 

1 

16.67% 

56 

1 

0.93% 

1 

100.00% 

others 

16 

14.81% 

3 

18.75% 

total 

108 

100% 

13 

12.08% 


Domestication, a possible reason for the functional ORs genes 
reduction 

The difference between dogs and wolf, although not significant, 
goes in the direction expected if domestication was the causes. 
Dogs appear to have indeed a smaller functional OR repertoire 
than wolf. In a domestic environment, dogs rely on humans es¬ 
pecially for activities, while olfaction plays a critical role in the 
wild subspecies, such as food, friend location and other social 
interactions. In such a case, OR genes are lost by mutation and 
random drift. For example, random mutations can accumulate to 
lead to the inactivation of the gene, and the mutated gene can 
increase its frequency in the population. However, mutations that 
do not affect the fitness usually require long time to spread with¬ 
in the population and the reach fixation. The time elapsed since 
domestication (15 000-17 000 years) (Savolainen et al. 2002) can 
hardly account for such an effect. 

On the other hand, the loss of OR could be the result of an ac¬ 
tive selective process. In contrast to neutral traits, traits under 
selection get fixed rapidly, as few as 10 generations can suffice, 
and domestication provides very strong selection. An example of 
such phenomena is the connection between agouti coat color and 
tameness observed in several mammals (Price 2002). 

The changes in nasal airways might be an adaptive change as a 
result of short time domestication. It is the strengthening hy¬ 
pothesis that olfaction, which is especially apparent between pig 
and wild boar (Olender et al. 2004), might have been under some 
sort of selection during domestication (Price 2002). 

However, there was no significant difference to conclude that 
the living environment provided by domestication played a major 
role in the reduction of the functional olfactory receptor reper¬ 


toire. Therefore, the purpose of domestication may also play a 
key role on dogs’ olfaction. Olfaction also plays a key role in the 
behavior of dogs and in its social life. The reduction of dogs’ OR 
genes is not apparent. 
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Different mammals 


Fig. 2 Comparison of the fraction of pseudogenes in different mam¬ 
mals (Godfrey et al. 2004; Olender et al. 2004; Rouquier et al. 2000) 


Conclusion 

According to the present results and other researches carried out 
previously, it is inferred that domestication might lead to the 
reduction of OR genes at a certain extent, while the purpose of 
domestication also plays an important role on the selection of 
dogs’ olfaction. During the course of evolution, there were vari¬ 
ous factors influencing the change of olfaction, and therefore, 
much more studies have to be carried out to explain the evolu¬ 
tional meaning from wolf to dog. 
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